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Please find below a communication from the EXAMINER in charge of this application 

Commissioner of Patents 

1 . This application contains sequence disclosures that arc encompassed by the definitions for 
nucleotide and/or amino acid sequences set forth in 37 CFR 1 .B2I(aXl) and (a)(2). However, 
this application failsto comply with the requirements of 37 CFR 1,82 1 through 1 ,825 for the * 
reason(s) set fiwffibn the attached Notice To Comply With Requirements ForFalenTAppIicatjons 
Containing Nucleotide Sequence And/Or Amino Acid Sequence Disclosures. 

2. Applicant is given ONE MONTH, or THIRTY DAYS, whichever is longer, from the 
mailing date of this letter within which to comply with the sequence rules, 37 CFR 1.821 - 1,825, 
Failure to comply with these requirements will result in ABANDONMENT of the application 
under 37 CFR i .821(g). Extensions of time may be obtained by filing a petition accompanied by 
the extension lee under the provisions of 37 CFR 1.136(a), In no case may an applicant extend 
the period for reply beyond the SIX MONTH statutory period. Direct the reply to the 
undersigned. Applicant is requested to return a copy of the attached Notice to Comply with the 
reply. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Ginny Portner whose telephone number is (703)308- 
7543. The examiner can normally be reached on Monday through Friday from 7:30 AM 
to 5:00 PM except for the first friday of each two week period. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Lynette Smith, can be reached on (703) 308-3909. The fax phone number 
for this group is (703) 308-4242. 

The Group and/or Art Unit location of your application in the PTO will be Group 
Art Unit 1645. To aid in correlating any papers for this application, all further 
correspondence regarding this application should be directed to this Art Unit. 

Any inquiry of a general nature or relating to the status of this application should 
be directed to the Group receptionist whose telephone number is (703) 308-0196. 



LYNETTE R. F. SMITH 
SUPERVISORY PATENT EXAMINER 
TECHNOLOGY CENTER 1600 



) Application No.: 09/47 )77 

NOTICE TO COMPLY WITh REQUIREMENTS FOR PATENT APPLICATIONS CONTAINING 
NUCLEOTIDE SEQUENCE AND/OR AMINO ACID SEQUENCE DISCLOSURES 

The nucleotide and/or amino acid sequence disclosure contained in this application does not 
comply with the requirements for such a disclosure as set forth in 37 C.F.R. 1.821 - 1 825 for the 
following reason(s): 

j£f 1 . This application clearly falls to comply with the requirements of 37 C.F.R. 1 .821-1 .825 Applicant's 
'T-K attention is directed to these regulations, published at 1 1 14 OG 29, May 15 1990 and at 55 FR 
18230, May 1, 1990. 

□ 2. This application does not contain, as a separate part of the disclosure on paper copv a "Seauence 
Listing" as required by 37 C.F.R. 1 .821 (c). 

3. A copy of the "Sequence Listing" in computer readable form has not been submitted as reauired bv 
37 C.F.R. 1.821(e). M ' y 

Fl 4. A copy of the "Sequence Listing" in computer readable form has been submitted. However, the 
I—I content of the computer readable form does not comply with the requirements of 37 C.F.R. 1 822 
and/or 1 .823, as indicated on the attached copy of the marked -up "Raw Sequence Listing." 

n 5. The computer readable form that has been filed with this application has been found to be damaged 
L-J and/or unreadable as indicated on the attached CRF Diskette Problem Report. A Substitute 
computer readable form must be submitted as required by 37 C.F.R. 1 .825(d). 

□ 6. The paper copy of the "Sequence Listing" is not the same as the computer readable from of the 
"Sequence Listing" as required by 37 C.F.R. 1.821(e). 

Applicant Must Provide: 

|x| An initial or substitute computer readable form (CRF) copy of the "Sequence Listing". 

□ An Initial or substitute paper copy of the "Sequence Listing", as well as an amendment directing its entry 
into the specification. 

JxJ A statement that the content of the paper and computer readable copies are the same and, where 
L - ' applicable, include no new matter, as required by 37 C.F.R. 1 .821(e) or 1 .821 ff) or 1 821(a) or 
1.825(b) or 1.825(d). 

For questions regarding compliance to these requirements, please contact: 

For Rules Interpretation, call (703) 308-4216 
For CRF Submission Help, call (703) 308-4212 
Patentln Software Program Support (SIRA) 

Technical Assistance 703-287-0200 

To Purchase Patentln Software 703-306-2600 



PLEASE RETURN A COPY OF THIS NOTICE WITH YOUR RESPONSE 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Wolf, Marcia K 
Cassels, Fred J 
Boedeker, Edgar C 

(ii) TITLE OF INVENTION: Transformed Bacteria Producing GS6 
Antigens as Vaccines 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Hendricks and Assoc 

(B) STREET: P.O. Box 2509 

(C) CITY: Fairfax 

(D) STATE: VA 

(E) COUNTRY: US 

(F) ZIP: 22031 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/479,877 

(B) FILING DATE: 10-JAN-2000 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Hendricks, Glenna M 

(B) REGISTRATION NUMBER: 32,535 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 703/425-8405 

(B) TELEFAX: 703/425-8406 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



AAGCTTGTAA 


CCAGTTGATA 


AAAATATATC 


ACGCTGGGAA 


TGACGTGATG 


TATATACGGA 


60 


GCAGCTATGT 


CGGAACAGAT 


ATTTTCCTAT 


CGGTATGCGT 


TGTGAGTAAG 


CGTAAAGCCA 


120 


ATGCTGTCTG 


TAACTCCTGA 


TCCTTGCAGA 


CTAAATTAGA 


GCTCCTTCTA 


AATTAGACGG 


180 


ATGGATAAAC 


CTACAGACTG 


GCGCTCTGGG 


TCTCGCCGGA 


TATTTTCTAA 


TGAATTTAAG 


240 


CTTCATATGG 


TTGAACTGGC 


TTCGAAACCA 


AATGCCAATG 


TCGCACAACT 


GGCTCGGGAA 


300 


CATGGCGTTG 


ATAACAACCT 


GATTTTTAAA 


TAGCTACGCC 


TCTGGCAAAG 


AGAAGGACGT 


360 


ATTTCTCGTA 


GAATGCCTCC 


AACTATTGTA 


GGCCCTACAG 


TACCACTGAG 


GTAGCCTGAA 


420 


TTTAAAGCCG 


AAGCGGTCAG 


AACTGTTCTT 


GGTGTGAACG 


TAGCACTCAC 


CAATAAAAGC 


480 


ATCAATACGG 


TGCTCTGTTG 


ACACATTACG 


AATGTTATGT 


ATACAATAAA 


AATGATTATA 


540 


GCAATATTAA 


TGGTGTTATA 


TGAAGAAAAC 


AATTGGTTTA 


ATTCTAATTC 


TTGCTTCATT 


600 


CGGCAGCCAT 


GCCAGAACAG 


AAATAGCGAC 


TAAAAACTTC 


CCAGTATCAA 


CGACTATTTC 


660 


AAAAAGTTTT 


TTTGCACCTG 


AACCACGAAT 


ACAGCCTTCT 


TTTGGTGAAA 


ATGTTGGAAA 


720 


GGAAGGAGCT 


TTATTATTTA 


GTGTGAACTT 


AACTGTTCCT 


GAAAATGTAT 


CCCAGGTAAC 


780 


GGTCTACCCT 


GTTTATGATG 


AAGATTATGG 


GTTAGGACGA 


CTAGTAAATA 


CCGCTGATGC 


840 


TTCCCAATCA 


ATAATCTACC 


AGATTGTTGA 


TGAGAAAGGG 


AAAAAAATGT 


TAAAAGATCA 


900 


TGGTGCAGAG 


GTTACACCTA 


ATCAACAAAT AACTTTTAAA GCGCTGAATT ATACTAGCGG 


960 


GGAAAAAAAA 


ATATCTCCTG 


GAATATATAA 


CGATCAGGTT 


ATGGTTGGTT 


ACTATGTAAA 


1020 


CTAAATACTG 


GAAGTATGAT 


TATGTTGAAA 


AAAATTATTT 


CGGCTATTGC 


ATTAATTGCA 


1080 


GGAACTTCCG 


GAGTGGTAAA 


TGCAGGAAAC 


TGGCAATATA 


AATCTCTGGA 


TGTAAATGTA 


1140 


AATATTGAGC 


AAAATTTTAT 


TCCAGATATT 


GATTCCGCTG 


TTCGTATAAT 


ACCTGTTAAT 


1200 


TACGATTCGG 


ACCCGAAACT 


GGATTCACAG 


TTATATACGG 


TTGAGATGAC 


GATCCCTGCA 


1260 


GGTGTAAGCG 


CAGTTAAAAT 


CGCACCAACA 


GATAGTCTGA 


CATCTTCTGG 


ACAGCAGATC 


1320 


GGAAAGCTGG 


TTAATGTAAA 


CAATCCAGAT 


CAAAATATGA 


ATTATTATAT 


CAGAAAGGAT 


1380 


TCTGGCGCTG 


GTAACTTTAT 


GGCAGGACAA 


AAAGGATCCT 


TTCCTGTCAA AGAGAATACG 


1440 


TCATACACAT 


TCTCAGCAAT 


TTATACTGGT 


GGCGAATACC 


CTAATAGCGG 


ATATTCGTCT 


1500 



GGTACTTATG 


CAGGAAATTT 


GACTGTATCA 


TTTTACAGCA ATTAAAAAAA GGCCGCATTA 


1560 


TTGCGGCCAT 


TGACGATACT 


GCTAGGCAAA 


AATATGAAAT 


CAAAGTTAAT 


TATACTATTG 


1620 


ACGTTAGTGC 


CATTTTCATC 


TTTTTCAACA 


GGAAATAATT 


TTGAAATAAA 


TAAGACACGA 


1680 


GTAATTTACT 


CTGACAGCAC 


ACCATCAGTT 


CAAATATCAA 


ATAATAAAGC 


ATATCCTTTA 


1740 


ATTATTCAAA 


GCAATGTATG 


GGATGAAAGC 


AATAATAAAA ATCATGACTT TATAGCAACA 


1800 


CCACCGATTT 


TTAAAATGGA 


AAGTGAAAGT 


CGGAATATAA 


TAAAAATAAT 


TAAAACAACT 


1860 


ATTAATTTGC 


CGGACTCTCA 


GGAAAGTATG 


AGATGGTTAT 


GTATTGAATC 


AATGCCACCA 


1920 


ATAGAAAAAA 


GTACTAAAAT 


AAACAGAAAA 


GAAGGAAGGA 


CAGACAGTAT 


TAATATCAGC 


1980 


ATTCGGGGGT 


GCATTAAACT 


GATATATCGA 


CCTGCCAGTG 


TTCCGTCTCC 


TGTTTTTAAT 


2040 


AATATAGTAG 


AAAAATTAAA ATGGCATAAA AATGGAAAGT ATCTTGTATT 


AAAAAATAAT 


2100 


ACACCCTATT 


ACATTAGCTT 


TTCTGAGGTT 


TTTTTTGATT 


CAGATAAAGT 


AAACAATGCA 


2160 


AAAGATATTT 


TATATGTAAA 


ACCATACTCA 


GAGAAGAAAA 


TAGATATCAG 


CAACAGAATA 


2220 


ATAAAAAAAA 


TCAAATGGGC 


TATGATTGAT 


GATGCTGGCG 


CAAAAACAAA 


ACTTTATGAA 


2280 


TCAATTTTAT 


AAAAAATCTC 


ATTACAGTAT 


ACAAAAACAT 


CAGATTACAG 


GCTTGCTTTT 


2340 


TTTGCTATTT 


ATATATCCTT 


TCTCAACCTC 


ATATGGAAAT 


GAACAATTTA 


GTTTTGACTC 


2400 


ACGATTCCTA 


CCATCAGGTT 


ATAATTACTC 


TTTAAATAGT 


AACTTACCTC 


CTGAAGGTGA 


2460 


GTATCTGGTT 


GATATTTATA 


TTAACAAAAT 


AAAAAAGGAG 


TCCGCGATTA 


TTCCTTTTTA 


2520 


TATAAAAGGA AATAAACTTG TACCATGTTT 


ATCAAAAGAA 


AAAATTTCAT 


CTTTGGGTAT 


2580 


CAACATTAAT 


AATAACGACA ACACAGAGTG TGTAGAAACA AGTAAGGCAG 


GTATTAGTAA 


2640 


TATCAGCTTT 


GAGTTTAGCT 


CTCTTCGTTT 


GTTTATTGCT 


GTACCGAAAA 


ATCTTCTGTC 


2700 


TGAGATTGAT 


AAAATATCAT 


CAAAGGATAT 


AGATAACGGG 


ATTCATGCTT 


TATTTTTTAA 


2760 


TTATCAAGTA 


AATACAAGGC 


TAGCCAATAA 


TAAAAATCGT 


TATGATTACA 


TTTCTGTTTC 


2820 


ACCAAATATA 


AATTATTTTT 


CATGGCGGTT 


GCGTAATCTT 


TTTGAATTTA 


ACCAAAACAA 


2880 


CGATGAAAAA 


ACATGGGAAA 


GAAACTACAC 


TTATCTAGAA 


AAAAGTTTTT 


ATGATAAAAA 


2940 


GCTAAACTTA 


GTCGTTGGTG 


AAAGTTATAC 


GAATTCAAAT 


GTTTATAATA 


ACTACTCTTT 


3000 


TACTGGTATT 


TCAGTTTCTA 


CAGATACAGA 


TATGTATACG 


CCAAGTGAAA 


TCGATTATAC 


3060 


ACCAGAAATT 


CATGGAGTGG 


CTGATTCAGA 


CTCTCAGATT 


ATTGTCAGGC 


AAGGCAACAC 


3120 



CATTATCATT 


AATGAAAGTG 


TTCCAGCCGG 


ACCGTTCTCA 


TTTCCAATAA 


CCAATCTCAT 


3180 


GTATACTGGG 


GGGCAACTTA 


ATGTGGAGAT 


AACAGATATT 


TATGGAAATA AAAAACAATA 


3240 


TACTGTCAAT 


AATTCCTCTC 


TTCCTGTTAT 


GAGAAAAGCG 


GGACTAATGG 


TATATAATTT 


3300 


TATATCTGGG 


AAATTAACAA 


AAAAAAATAG 


TGAGGATGGT 


GATTTTTTTA 


CTCAAGGTGA 


3360 


TATTAACTAC 


GGTACTCACT 


ATAACAGCAC 


ACTATTCGGT 


GGATATCAGT 


TTAGTAAAAA 


3420 


TTATTTTAAC 


TTATCTACTG 


GTATAGGCAC 


TGATCTGGGA 


TTTTCTGGAG 


CATGGCTACT 


3480 


ACACGTTAGC 


AGAAGTAATT 


TTAAGAATAA 


AAATGGATAT 


AATATTAATC 


TACAACAAAA 


3540 


CACTCAGTTA 


AGACCATTCA 


ATGCCGGGGT 


TAATTTCGAT 


TACGCATACA 


GAAAAAAAAG 


3600 


GTATGTGGAA 


CTTTCCGACA 


TTGGCTGGCA 


TGGTAATTTA 


TATAATCAAC 


TTAAAAATAG 


3660 


TTTTTCTTTA 


TCCTTGTCAA 


AATCATTGAA 


TAAATACGGA 


AATTTCTCAC 


TTGATTATAA 


3720 


CAAAATGAAA 


TACTGGGATA 


ATGCGTATGA 


TAGTAACTCA 


ATGTCGATTC 


GTTATTTTTT 


3780 


TAAATTCATG 


CGAGCAATGA 


TTACAACAAA 


TTGTTCTTTA 


AATAAATATC 


AATCTTATGA 


3840 


AAAAAAAGAT 


AAAAGATTTA 


GTATTAATAT 


ATCATTGCCT 


TTAACCAAAG 


ATTACGGGCA 


3900 


CATATCTTCA 


AACTATTCAT 


TTTCCAATGC 


AAATACAGGA 


ACGGCAACCA 


GTTCTGTAGG 


3960 


CTTAAACGGT 


AGTTTTTTTA 


ATGACGCAAG 


ATTAAACTGG 


AACATTCAGC 


AGAACAGAAC 


4020 


GACCCGTAAC 


AATGGATATA 


CTGATAATAC 


CAGTTACATA 


GCAACCAGCT 


ATGCCTCTCC 


4080 


CTATGGCGTT 


TTTACTGGTT 


CATATTCAGG 


ATCGAACAAG 


TATTCAAGCC 


AGTTTTATTC 


4140 


TGCATCGGGA 


GGTATTGTTT 


TGCATAGCGA 


TGGCGTAGCT 


TTTACTCAAA 


AAGCCGGAGA 


4200 


TACCTCTGCT 


CTTGTCCGTA 


TTGATAATAT 


TTCTGATATA 


AAAATTGGTA 


ACACTCCTGG 


4260 


TGTTTATACT 


GGGTATAATG 


GTTTTGCTTT 


AATTCCTCAT 


CTTCAGCCGT 


TCAAAAAAAA 


4320 


CACCATTTTA 


ATTAATGATA 


AAGGAATTCC 


AGACGGTATT 


ACTCTTGCTA 


ATATAAAAAA 


4380 


ACAAGTTATC 


CCATCACGAG 


GAGCTATTGT 


TAAAGTAAAA 


TTTGATGCTA 


AAAAAGGCAA 


4440 


TGACATTTTG 


TTTAAGCTTA 


CAACTAAAGA 


TGGAAAAACG 


CCCCCATTAG 


GAGCTATAGC 


4500 


CCATGAAAAA 


AATGGAAAAC 


AGATTAATAC 


GGGTATCGTT 


GACGATGATG 


GTATGCTTTA 


4560 


TATGTCTGGA 


TTATCAGGGA 


CAGGGATTAT 


TAATGTAACA 


TGGAATGGAA 


AAGTCTGTTC 


4620 


ATTTCCTTTT 


TCAGAAAAAG 


ATATATCTAG 


CAAACAATTA 


TCTGTTGTAA 


ATAAACAATG 


4680 


TTAGGTAGTG 


CATCCAATTA 


GTAGAACATG 


TGTTTTTCGA 


TAAACGCTCC 


GATCTCTTTT 


4740 



TCGTGGATCT CAACTGAGCG TGAGAAGCAG ATTGTTTTAC GAGCCAACCG CTTAATGCGG 
GTGCGTAGCG TCAGATTATT ACGCTCAATG CGTTGGGTGA ATATTTTGCC GGTCAGATGC 
TTATTCTTCG GTACC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GGCCGCATTA TTGCGGCC 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GGCCGCATTA TTGATTGCGG CC 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



60 
120 



300 
360 



(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TTGACACATT ACGAATGTTA TGTATACAAT AAAAATGATT ATAGCAATAT TAATGGTGTT 
ATATGAAGAA AACAATTGGT TTAATTCTAA TTCTTGCTTC ATTCGGCAGC CATGCCAGAA 
CAGAAATAGC GACTAAAAAC TTCCCAGTAT CAACGACTAT TTCAAAAAGT TTTTTTGCAC 180 
CTGAACCACG AATACAGCCT TCTTTTGGTG AAAATGTTGG AAAGGAAGGA GCTTTATTAT 240 
TTAGTGTGAA CTTAACTGTT CCTGAAAATG TATCCCAGGT AACGGTCTAC CCTGTTTATG 
ATGAAGATTA TGGGTTAGGA CGACTAGTAA ATACCGCTGA TGCTTCCCAA TCAATAATCT 
ACCAGATTGT TGATGAGAAA GGGAAAAAAA TGTTAAAAGA TCATGGTGCA GAGGTTACAC 420 
CTAATCAACA AATAACTTTT AAAGCGCTGA ATTATACTAG CGGGGAAAAA AAAATATCTC 480 
CTGGAATATA TAACGATCAG GTTATGGTTG GTTACTATGT AAACTAA 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Lys Lys Thr He Gly Leu He Leu He Leu Ala Ser Phe Gly Ser 
1 5 10 



527 



15 



His Ala Arg Thr Glu He Ala Thr Lys Asn Phe Pro Val Ser Thr Thr 
20 25 30 

He Ser Lys Ser Phe Phe Ala Pro Glu Pro Arg He Gin Pro Ser Phe 
35 40 45 

Gly Glu Asn Val Gly Lys Glu Gly Ala Leu Leu Phe Ser Val Asn Leu 
50 55 6 o 



Thr Val Pro Glu Asn Val Ser Gin Val Thr Val Tyr Pro Val Tyr Asp 

70 75 80 

Glu Asp Tyr Gly Leu Gly Arg Leu Val Asn Thr Ala Asp Ala Ser Gin 

85 90 95 

Ser lie He Tyr Gin He Val Asp Glu Lys Gly Lys Lys Met Leu Lys 
100 105 110 * 

Asp His Gly Ala Glu Val Thr Pro Asn Gin Gin He Thr Phe Lys Ala 
115 120 125 

Leu Asn Tyr Thr Ser Gly Glu Lys Lys He Ser Pro Gly He Tyr Asn 
130 135 140 

Asp Gin Val Met Val Gly Tyr Tyr Val Asn 
145 150 

) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Leu Lys Lys He He Ser Ala He Ala Leu He Ala Gly Thr Ser 
1 5 10 15 

Gly val Val Asn Ala Gly Asn Trp Gin Tyr Lys Ser Leu Asp Val Asn 
20 25 30 

Val Asn lie Glu Gin Asn Phe He Pro Asp He Asp Ser Ala Val Arq 
35 40 45 * 

He lie Pro Val Asn Tyr Asp Ser Asp Pro Lys Leu Asp Ser Gin Leu 
50 55 60 



Tyr Thr Val Glu Met Thr He Pro Ala Gly Val Ser Ala Val Lys He 
65 70 75 80 

Ala Pro Thr Asp Ser Leu Thr Ser Ser Gly Gin Gin He Gly Lys Leu 

85 90 95 

Val Asn Val Asn Asn Pro Asp Gin Asn Met Asn Tyr Tyr He Arg Lys 
100 105 no 

Asp Ser Gly Ala Gly Asn Phe Met Ala Gly Gin Lys Gly Ser Phe Pro 
115 120 125 

Val Lys Glu Asn Thr Ser Tyr Thr Phe Ser Ala He Tyr Thr Glv Glv 
130 135 140 

Glu Tyr Pro Asn Ser Gly Tyr Ser Ser Gly Thr Tyr Ala Gly Asn Leu 
145 150 155 

Thr Val Ser Phe Tyr Ser Asn 

165 



160 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asn Asn Phe Glu He Asn Lys Thr Arg Val He Tyr Ser Asp Ser Thr 
1 5 10 15 

Pro Ser Val Gin He Ser Asn Asn Lys Ala Tyr Pro Leu He He Gin 
20 25 30 



Ser Asn Val Trp Asp Glu Ser Asn Asn Lys Asn His Asp Phe He Ala 
35 40 45 



Thr Pro Pro He Phe Lys Met Glu Ser Glu Ser Arg Asn He He Lys 
50 55 60 

lie He Lys Thr Thr He Asn Leu Pro Asp Ser Gin Glu Ser Met Arg 
GD 70 75 80 

Trp Leu cys He Glu Ser Met Pro Pro He Glu Lys Ser Thr Lvs He 

85 go 9 l 

Asn Arg Lys Glu Gly Arg Thr Asp Ser He Asn He Ser He Arg Gly 
100 105 110 * * 

Cys He Lys Leu He Tyr Arg Pro Ala Ser Val Pro Ser Pro Val Phe 
115 120 125 

Asn Asn He Val Glu Lys Leu Lys Trp His Lys Asn Gly Lys Tyr Leu 
130 135 140 

Val Leu Lys Asn Asn Thr Pro Tyr Tyr He Ser Phe Ser Glu Val Phe 
145 150 155 i 6 o 

Phe Asp Ser Asp Lys Val Asn Asn Ala Lys Asp He Leu Tyr Val Lys 

165 170 175 

Pro Tyr Ser Glu Lys Lys He Asp He Ser Asn Arg He He Lys Lys 
180 185 190 

He Lys Trp Ala Met He Asp Asp Ala Gly Ala Lys Thr Lys Leu Tyr 
195 200 205 

Glu Ser He Leu 
210 

) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Asn Gin Phe Tyr Lys Lys Ser His Tyr Ser He Gin Lys His Gin 
15 10 15 

He Thr Gly Leu Leu Phe Leu Leu Phe He Tyr Pro Phe Ser Thr Ser 
20 25 30 

Tyr Gly Asn Glu Gin Phe Ser Phe Asp Ser Arg Phe Leu Pro Ser Gly 
Ja 40 45 

Tyr Asn Tyr Ser Leu Asn Ser Asn Leu Pro Pro Glu Gly Glu Tyr Leu 
50 55 60 

Val Asp He Tyr He Asn Lys He Lys Lys Glu Ser Ala He He Pro 
65 70 75 80 

Phe Tyr He Lys Gly Asn Lys Leu Val Pro Cys Leu Ser Lys Glu Lys 

85 90 95 

He Ser Ser Leu Gly He Asn He Asn Asn Asn Asp Asn Thr Glu Cys 
100 105 110 

Val Glu Thr ser Lys Ala Gly He Ser Asn He Ser Phe Glu Phe Ser 
115 120 125 

Ser Leu Arg Leu Phe He Ala Val Pro Lys Asn Leu Leu Ser Glu He 
130 135 140 



Asp Lys He Ser Ser Lys Asp He Asp Asn Gly He His Ala Leu Phe 
145 I 50 155 leo 

Phe Asn Tyr Gin Val Asn Thr Arg Leu Ala Asn Asn Lys Asn Arg Tvr 

165 170 17 | 




Arg Asn Tyr Thr Tyr Leu Glu Lys Ser Phe Tyr Asp Lys Lys Leu Asn 



215 



220 



Leu Val Val Gly Glu Ser Tyr Thr Asn Ser Asn Val Tyr Asn Asn Tyr 



230 



235 



240 



Ser Phe Thr Gly He Ser Val Ser Thr Asp Thr Asp Met Tyr Thr Pro 

245 250 



255 



Ser Glu He Asp Tyr Thr Pro Glu He His Gly Val Ala Asp Ser Asp 

260 *\ jt XT 



265 



270 



Ser Gin lie He Val Arg Gin Gly Asn Thr He He He Asn Glu Ser 
275 280 285 



Val 290 Ala Gly Pro Phe ™ Phe Pro Ile Thr Asn Leu Met T y r 



295 



Thr 



300 



Gly Gly Gin Leu Asn Val Glu Ile Thr Asp He Tyr Gly Asn Lys Lys 
305 310 315 * 320 

Gin Tyr Thr Val Asn Asn Ser Ser Leu Pro Val Met Arg Lys Ala Glv 

325 330 335 

Leu Met Val Tyr Asn Phe Ile Ser Gly Lys Leu Thr Lys Lys Asn Ser 
340 345 350 

Glu Asp Gly Asp Phe Phe Thr Gin Gly Asp Ile Asn Tyr Gly Thr His 
355 360 365 



370 ** U ™ Phe Ser LyS ASn Tyr Phe 



375 



380 



Asn Leu Ser Thr Gly He Gly Thr Asp Leu Gly Phe Ser Gly Ala 

385 ^r\r\ -i 



390 



395 



Trp 
400 

Leu Leu His Val Ser Arg Ser Asn Phe Lys Asn Lys Asn Gly Tyr Asn 

405 410 



Ile Asn Leu Gin Gin Asn Thr Gin Leu Arg Pro Phe Asn Ala Gly Val 
420 425 430 



Asn Phe Asp Tyr Ala Tyr Arg Lys Lys Arg Tyr Val Glu Leu Ser Asp 



440 



445 



He Gly Trp His Gly Asn Leu Tyr Asn Gin Leu Lys Asn Ser Phe Ser 
450 455 460 

Leu Ser Leu Ser Lys Ser Leu Asn Lys Tyr Gly Asn Phe Ser Leu Asp 



470 



475 



480 



Tyr Asn Lys Met Lys Tyr Trp Asp Asn Ala Tyr Asp Ser Asn Ser Met 

485 490 495 

Ser He Arg Tyr Phe Phe Lys Phe Met Arg Ala Met He Thr Thr Asn 
500 505 510 



Cys Ser Leu Asn Lys Tyr Gin Ser Tyr Glu Lys Lys Asp Lys Arq Phe 
515 520 



525 



Ser Hit ASn Ile Ser ^ Pro ^ Thr As P T yr Gly His He Ser 
530 535 540 

Ser Asn Tyr Ser Phe Ser Asn Ala Asn Thr Gly Thr Ala Thr Ser Ser 
545 550 555 5 6 o 

Val Gly Leu Asn Gly Ser Phe Phe Asn Asp Ala Arg Leu Asn Trp Asn 

565 570 575 

He Gin Gin Asn Arg Thr Thr Arg Asn Asn Gly Tyr Thr Asp Asn Thr 
580 585 590 



Ser Tyr Ile Ala Thr Ser Tyr Ala Ser Pro Tyr Gly Val Phe Thr Glv 

595 ■* 



600 



605 



Ser Tyr Ser Gly Ser Asn Lys Tyr Ser Ser Gin Phe Tyr Ser Ala Ser 
610 615 6 20 



Gly Gly Ile Val Leu His Ser Asp Gly Val Ala Phe Thr Gin Lys Ala 

620 ^*>r* 



630 



635 



640 



Gly Asp Thr Ser Ala Leu Val Arg Ile Asp Asn Ile Ser Asp Ile Lys 

645 650 655 



He Gly Asn Thr Pro Gly Val Tyr Thr Gly Tyr Asn Gly Phe Ala Leu 
660 665 6 7o 

He Pro His Leu Gin Pro Phe Lys Lys Asn Thr He Leu He Asn Asp 
675 680 685 

Lys Gly He Pro Asp Gly He Thr Leu Ala Asn He Lys Lys Gin Val 
690 695 700 

He Pro Ser Arg Gly Ala He Val Lys Val Lys Phe Asp Ala Lys Lys 
705 710 715 1 720 

Gly Asn Asp He Leu Phe Lys Leu Thr Thr Lys Asp Gly Lys Thr Pro 

725 730 735 

Pro Leu Gly Ala He Ala His Glu Lys Asn Gly Lys Gin He Asn Thr 
740 745 750 

Gly He Val Asp Asp Asp Gly Met Leu Tyr Met Ser Gly Leu Ser Gly 
755 760 765 

Thr nil Ile Ile Asn Val Thr Tr P Asn G1 y L Y S Val cys Ser Phe Pro 
770 775 780 



Phe Ser Glu Lys Asp Ile Ser Ser Lys Gin Leu Ser Val Val Asn Lys 

785 *7ftA — — — -* 

Gin Cys 



785 790 795 800 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Arg Thr Glu He Ala Thr Lys Asn Phe Pro Val Ser Thr Thr He Ser 

5 10 15 

Lys Ser Phe Phe Ala Pro Glu Pro Arg lie Gin Pro Ser Phe Gly Glu 

25 30 

Asn Val Gly Lys Glu Gly Ala Leu Leu Phe Ser Val Asn Leu Thr Val 
35 40 45 

Pro Glu Asn val Ser Gin Val Thr Val Tyr Pro Val Tyr Asp Glu Asp 

55 60 

Tyr Gly Leu Gly Arg Leu Val Asn Thr Ala Asp Ala Ser Gin Ser He 

70 75 80 

He Tyr Gin He Val Asp Glu Lys Gly Lys Lys Met Leu Lys Asp His 

85 90 95 

Gly Ala Glu yal Thr Pro Asn Gin Gin He Thr Phe Lys Ala Leu Asn 

105 xiO 

Tyr Thr Ser Gly Glu Lys Lys lie Ser Pro Gly He Tyr Asn Asp Gin 
J.J.O 120 



125 



Val Met Val Gly Tyr Tyr Val Asn 
130 135 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Gly Asn Trp Gin Tyr Lys Ser Leu Asp Val Asn Val Asn He Glu Gin 
1 5 10 



I 



Asn Phe lie Pro Asp lie Asp Ser Ala Val Arg lie lie Pro Val Asn 
20 25 30 

Tyr Asp Ser Asp Pro Lys Leu Asp Ser Gin Leu Tyr Thr Val Glu Met 
35 40 45 

Thr lie Pro Ala Gly Val Ser Ala Val Lys lie Ala Pro Thr Asp Ser 
50 55 60 

Leu Thr Ser Ser Gly Gin Gin lie Gly Lys Leu Val Asn Val Asn Asn 
65 70 75 80 

Pro Asp Gin Asn Met Asn Tyr Tyr lie Arg Lys Asp Ser Gly Ala Gly 

85 90 95 

Asn Phe Met Ala Gly Gin Lys Gly Ser Phe Pro Val Lys Glu Asn Thr 
100 105 no 

Ser Tyr Thr Phe Ser Ala lie Tyr Thr Gly Gly Glu Tyr Pro Asn Ser 
115 120 125 



Gly Tyr Ser Ser Gly Thr Tyr Ala Gly Asn Leu Thr Val Ser Phe Tyr 
130 135 140 



Ser Asn 
145 



